Efficient Memory Organization for DNN Hardware Accelerator Implementation on PSoC

نویسندگان

چکیده

The use of deep learning solutions in different disciplines is increasing and their algorithms are computationally expensive most cases. For this reason, numerous hardware accelerators have appeared to compute operations efficiently parallel, achieving higher performance lower latency. These need large amounts data feed each computing layers, which makes it necessary handle the transfers that collect information from accelerators. implementation these accelerators, hybrid devices widely used, an embedded computer, where operating system can be run, a field-programmable gate array (FPGA), accelerator deployed. In work, we present software API organizes memory, preventing reallocating one memory area another, improves native Linux driver with 85% speed-up reduces frame time by 28% real application.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel and Efficient Hardware Implementation of Scalar Point Multiplier

A new and highly efficient architecture for elliptic curve scalar point multiplication is presented. To achieve the maximum architectural and timing improvements we have reorganized and reordered the critical path of the Lopez-Dahab scalar point multiplication architecture such that logic structures are implemented in parallel and operations in the critical path are diverted to noncritical path...

متن کامل

A Cache-Based Hardware Accelerator for Memory Data Movements

T his dissertation presents a hardware accelerator that is able to accelerate large (including non-parallel) memory data movements, in particular memory copies, performed traditionally by the processors. As today’s processors are tied with or have integrated caches with varying sizes (from several kilobytes in hand-held devices to many megabytes in desktop devices or large servers), it is only ...

متن کامل

An Efficient Hardware Accelerator for HS1-SIV Encryption Algorithm

Data security is a major concern for everyone in today’s informational world. Encryption is the process of encoding messages or information in such a way that only authorized parties can read it. It is one of the major information security solutions. Hash Stream1-Synthetic Initialization Vector (HS1-SIV) is a recently developed and fast encryption algorithm. In this paper1, we present a hardwar...

متن کامل

Hardware supported efficient accelerator partitioning for workstation consolidation and virtualization

Accelerators have gained an important role in recent years. While being used primarily in the scientific community in the beginning, they are now employed in a wide range of every day applications. Accelerators can hence be viewed in the focus of machine consolidation and virtualization, offering new opportunities for cost saving and services. Although these opportunities have been discussed in...

متن کامل

Hardware Accelerator Approach Towards Efficient Biometric Cryptosystems for Network Security

Protecting data and its communication is a critical part of the modern network. The science of protecting data, known as cryptography, uses secret keys to encrypt data in a format that is not easily decipherable. However,most commonly secure logons for a workstation connected to a network use passwords to perform user authentication. These passwords are a weak link in the security chain, and ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2021

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics10010094